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1. Introduction 

Stochastic flows of diffeomorphisms have been a subject of much research [4, 6, 13, 15]. 
In this paper, we are interested in an important subclass of such flows, namely the 
Brownian flows of diffeomorphisms (cf. [15]). Our goal is to study small noise asymptotics, 
specifically, the large deviation principle (LDP) for such flows. 

Elementary examples of Brownian flows are those constructed by solving finite dimen- 
sional Ito stochastic differential equations. More precisely, suppose b, ft, i = 1, . . . , m, are 
functions from M.^ x [0,T] to M'* that are continuous in {x,t) and {k -\- l)-times continu- 
ously differentiable (with uniformly bounded derivatives) in x. Let /3i, . . . , be indepen- 
dent standard real Brownian motions on some filtered probability space (fl,J-,¥,{J-t}). 
Then for each s £ [0, T] and x G K'', there is a unique continuous { J'tj-adapted, R'^-valued 
process (a;), s <t <T, satisfying 

^s,t{x)=x+ b{<j>sAx),r)dr + Y, M<l>s,r{x),r)dl3,{r). (1.1) 

J s 1 J s 



This is an electronic reprint of the original article published by the ISI/BS in Bernoulli, 
2010, Vol. 16, No. 1, 234-257. This reprint differs from the original in pagination and 
typographic detail. 



1350-7265 © 2010 ISI/BS 



LDP for stochastic flows 



235 



By choosing a suitable modification, < s < t < T} defines a Brownian flow of C*^- 

diffeomorphisms (see Section 2). In particular, denoting by the topological group of 
C'^'-diffcomorphisms (see Section 3 for precise definitions of the topology and the metric 
on G*^), one has that </> = {^o,*: < t < T} is a random variable with values in the Polish 
space Wk = C([0,T] -.G^). For e g (0,oo), when fi is replaced by efi in (1.1), we write 
the corresponding flow as 0"^. Large deviations for 0^ in Wk^ as e — >■ 0, have been studied 
for the case fc = in [3, 18] and for general k in [5]. 

As is well known (cf. [4, 15, 16]), not all Brownian flows can be expressed as in (1.1) 
and in general one needs infinitely many Brownian motions to obtain a stochastic differ- 
ential equation (SDE) representation for the fiow. Indeed typical space-time stochastic 
models with a realistic correlation structure in the spatial parameter naturally lead to 
a formulation with infinitely many Brownian motions. One such example is given in 
Section 5. Thus, following Kunita's [15] notation for stochastic integration with respect 
to semimartingalcs with a spatial parameter, the study of general Brownian flows of 
C'^-diffeomorphisms leads to SDEs of the form 

d^sA^) = F{4>sAx)^^i)^ (l>sAx)=x, 0<s<t<T,xeR^ (1.2) 

where F{x, t) is a C^^^^ -Brownian motion (sec Definition 2.2). Note that such an F can be 
regarded as a random variable with values in the Polish space Wk = C{[0, T] : C'^+^(M'')), 
where C'''+i(R'^) is the space of (fc + l)-timcs continuously diffcrcntiablc functions from R 
to K''. Representations of such Brownian motions in terms of infinitely many independent 
standard real Brownian motions is well known (see, e.g., Kunita [15], Exercise 3.2.10). 
Indeed, one can represent F as 

F{x,t)= b{x,r)dr + y2 Mx,r)d(3,ir), {x,t) eR'' x [0,T], (1.3) 
Jo ,^^Jq 

where is an infinite sequence of i.i.d. real Brownian motions and &, fi arc suitable 

functions from x [0,T] to (see below Definition 2.2 for details). 

Letting a{x,y,t) = X^i^i fi{xit)fi{y^i) for x,y € R'', t € [0,r], the functions (a, 6) are 
referred to as the local characteristics of the Brownian motion F . When equation (1.2) 
is driven by the Brownian motion i^^ with local characteristics {sa,b), we will denote 
the corresponding solution by (p^ . In this work we establish a large deviation principle 
for {(j)'^,F^) in Wk-i x Wk-i- Note that the LDP is established in a larger space than 
the one in which {(ff ,F^) take values (namely, Wk x Wk)- This is consistent with results 
in [3, 5, 18], which consider stochastic flows driven by only finitely many real Brownian 
motions. The main technical difficulty in establishing the LDP in Wk x Wk is the proof of 
a result analogous to Proposition 4.10, which establishes tightness of certain controlled 
processes, when A: — 1 is replaced by k. 

As noted above, the stochastic dynamical systems considered in this work are driven 
by an infinite dimensional Brownian motion. A broadly applicable approach to the study 
of large deviations for such systems, based on variational representations for function- 
als of infinite-dimensional Brownian motions, has been developed in [7, 8]. Several au- 
thors have adopted this approach to analyze the large deviation properties of a variety 
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of models, including stochastic PDEs with random dynamic boundary conditions [22], 
stochastic Navier-Stokes equations [21] and infinite-dimensional SDEs with non-Lipschitz 
coefficients [19, 20]. The approach is a particularly attractive alternative to standard 
discretization/approximation methods (cf. [2]) when the state spaces are non-standard 
function spaces, such as the space of diffeomorphisms used in the present paper. 

The proof of our main result (Theorem 3.2) proceeds by verification of a general suffi- 
cient condition obtained in [8] (see Assumption 2 and Theorem 6 therein; see also Theo- 
rem 3.6 of the current paper). The verification of this condition essentially translates into 
establishing weak convergence of certain stochastic fiows defined via controlled analogues 
of the original model (see Theorem 3.5). These weak convergence proofs proceed by first 
establishing convergence for A'^-point motions of the fiow and then using Sobolev and 
Rellich-Kondrachov embedding theorems (see the proof of Proposition 4.10) to argue 
tightness and convergence as fiows. The key point here is that the estimates needed in 
the proofs are precisely those that have been developed in [15] for general qualitative 
analysis (e.g., existence, uniqueness) of the uncontrolled versions of the fiows. Unlike 
in [3, 5, 18] (which consider only finite-dimensional flows), the proof of the LDP docs 
not require any exponential probability estimates or discretization/approximation of the 
original model. 

In Section 5 of this paper we study an application of these results to a problem in 
image analysis. Stochastic diffcomorphic flows have been suggested for modeling prior 
statistical distributions on the space of possible images/targets of interest in the study 
of nonlinear inverse problems in this field (see [12] and references therein). Along with 
a data model, noise corrupted observations with such a prior distribution can then be 
used to compute a posterior distribution on this space, the "mode" of which yields an 
estimate of the true image underlying the observations. Motivated by such a Bayesian 
procedure, a variational approach to this image matching problem has been suggested 
and analyzed in [12]. A goal of the current paper is to develop a rigorous asymptotic 
theory that relates standard stochastic Bayesian formulations of this problem, in the 
small noise limit, with the deterministic variational approach taken in [12]. This is done 
in Theorem 5.1 of Section 5. 

Large deviations for finite-dimensional stochastic flows were studied for asymp- 
totic analysis of small noise finite-dimensional anticipative SDEs in [18] and of finite- 
dimensional diffusions generated by eLi + L2, where Li,L2 are two second-order dif- 
ferential operators, in [5]. Analogous problems for infinite-dimensional models can be 
treated using the large deviation principle established in the current paper. 

We now give an outline of the paper. Section 2 contains some background definitions 
of C'^-Brownian motions and Brownian flows. Section 3 presents the main large deviation 
result of the paper. The key weak convergence needed to prove this result. Theorem 3.5, 
is given in Section 4. Finally, Section 5 introduces the image analysis problem and uses 
the results of Section 3 to obtain an asymptotic result relating the Bayesian formulation 
of the problem with the deterministic variational approach of [12]. 

We generally follow the notation of [15]. A list of standard notational conventions is 
given at the end of the paper, but specialized notation is as follows: 
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• Let a = (ai, a2, . . . , ttd) be a multiindex of non-negative integers and \a\ = ai + 
0:2 + • ■ ■ + Qfd- For an |a|-times difFerentiable function / : R'^ — M, set 9"/ = 9"/ = 

{ax^ri---(dxar<i ■ ^^^^ ■^^^ ^"^"^ ^ ^if- If / = (A' /2> • • ■ , /d)' is an |a|- 
times differentiablc function from R'' to R"*, we write d"f = (i9"/i, 9"/2, . . . , fd)' ■ 
By convention d'^f^f. 

• For TO > denote by C™ the space of m-tinies continuously differentiablc functions 
from to R. 

• For any subset A C R'', to > and / e C", let 



J2 sup|9"/(x)|. 



The space C™ is a Frechct space with the countable collection of scminorms 
||/||m:A„,^n = {x '■ \x\ < n} . In particular, it is a Polish space with a topology that 
corresponds to "uniform convergence on compacts" . 

• For < (5 < 1, let 

II f I! • llfll + V ..n 

U\\m,S:A=V\\rn:A+ 2^ SUp 1 _ , , 

and 

f^m.S ^ ^ . _5.^^ < 00 for any n G N}. 

The scminorms {|| • \\m.5-A„,n E N} make C'"'* a Frechct space. 

• For jTi > denote by C™ the space of functions 5 : M"^ x R"^ — > R such that g{x,y), 
x,y £ R'' is m-times continuously differentiablc with respect to both x and y. En- 
dowed with the scminorms 



\\9\\m;A„ 



^ sup \d^d^g{x,y)l 



0<\a\<m 



where n G N, C" is a Frechct space. Also, for < (5 < 1 let 



where A-^^^.giy) = d^^yg{x,y) - d^, yg{x' ,y), d^^yg{x,y) = d^g{x,y). Then 

C™^* - {g G C™; |ig|i,7,,,-;A„ < for any n G N} 

is a Frechct space with respect to the scminorms {|| • 5 /i , £ N}. 

We write ||/|i™;R<i as ||/||m- The norms || • || • ||~, || • H^^^ arc to be interpreted 

in a similar manner. 
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. Let C^iR'') = If = ifu /2, . . . , fdYj e C™, * = 1, 2, . . . ,4 and ||/!|,„ = ^ti WMU 
The spaces C™'''(R''), C™(M'*''''), C™'''(M'*'''')^and their corresponding norms are de- 
fined similarly. In particular, note that h e C'"'''(R''^'') is a map from R'' X R"^ to 

• Let C^'^R'^) and C^'*(R''^'^) be the classes of measurable functions b:[0,T] 
C"''^{R'^) and a:[0,T] C""-^ {R'^'"^), respectively, such that 

||&||T,m.<5= sup \\b{t)\\m.5 < CO and 

0<t<T 

l|a||T,m,5= sup \\a{t)\\~ s <oo. 

0<t<T 

• We denote the Hilbert space 

^2 = % {xi,X2, . . .) : € R, i > 1 and a^i < oo , 

where the inner product on I2 is defined as {x, y)ir^ = Yl^i ^iUit a;, y G ^2- We denote 
the corresponding norm as || • \\i^. 

• Given a filtered probability space {ft,J-,P,{J-t}), we define 

Aih] = j-/- = {(f>t}Zi 'P^ ■ [0, T] R is { J'J-prcdictable for all i 
andpj^ l|0(s)||2^ds<c^|=l|. 

• Define 



5jv[y = |<A = {</>,L^iGiv2([o,r]:;2) s.t. 



{s)\\lds<N 



Endowed with the weak topology for the Hilbert space L^([0, T] : /^), S'7v[^2] is a 
compact Polish space. 
• Define 

-4^ [^2] ^{u^ A[l2] ■■ u{uj) e S'tv, P-a.s.}. 

We will always consider iS'Ar[^2] with the weak topology when referring to convergence 
in distribution of S'Ar[^2] -valued random variables. 



2. Preliminaries 



Let o denote the composition of maps and let id denote the identity map on R'^. 
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Definition 2.1 {Stochastic flows of homeomorphisms/diffeomorphisms) . A 

collection {0s^t(a;) :0 < s <t <T,x G Mf^} of -valued random variables on some filtered 
probability space (fl,T,P,{Tt}) 'is called a forward stochastic flow of homeomorphisms if 
there exists N Q T with P(A^) = such that for any lu £ A^'^; 

1. {s,t,x) i-^ (j)s^t{x,i-u) is a continuous map, 

2. 0s_ti(aj) ~ (j)t,u{^) ° 4>s.t{'^) holds for all s,t,u, < s <t<u<T, 

3. 4>s^s{^) = id for all s, < s <T, 

4. the map 0s,t(w) :R'* — > M"* is an onto homeomorphism for all s,t,0 < s < t <T . 

If in addition is k-times differ entiable with respect to x for all s <t and the 

derivatives are continuous in {s,t,x), it is called a stochastic flow of C'^ -diffeomorphisms. 

We now introduce a Brownian motion with a spatial parameter, with local characteris- 
tics (a,&). Throughout Sections 2-4, we will assume that (a,&) e C^'''(E''^'') x C^'''(M''), 
for some fc <£ N and 6 E (0,1]. Fix v such that < v < 5. 

Definition 2.2 {'C^'^ -Brownian motion). A continuous stochastic process {F(t)}t>o 
on some filtered probability space (rj, J",P,{-Ft}) with values in ^.^'•'{W^) is said to be 
a C^''^ -Brownian motion with local characteristics {a,b) if F{0), F{ti^i) ~ F{ti),i = 
0,l,...,n — 1, are independent C''"'^ {Mf^) -valued random variables whenever < to < 
ti < ■ ■ ■ < tn < T and if for each x G Mf^, M{x, t) ^ F{x, t) — b{x, r) dr is a contin- 
uous (d- dimensional) martingale such that {{AI{x,-),M{y,-)))t ~ J^a{x,y,r)dr for all 

The existence of a C'^'''^-Brownian motion with local characteristics (a, b) follows from 
[15] (see, e.g., Theorem 3.1.2 and Exercise 3.2.10). Indeed, for any j < S one can represent 
F as in (1.3), where fi-.W^x [0, T] M'* arc such that for each t e [0, T], /,(•, t) e C''''''(R'^), 

OO 

a{x,y,t)^^f^{x,t)f-{y,t), a.e. 
1=1 

and 

/ ^|/,(x,r)pdr<T||a||?.fe.,<oo. 
•^0 ,=1 

In particular, note that if is a C'^^'^-valued Brownian motion, its finite-dimensional 
restriction (F{xi,-), F{x2, ■),■■■, F^Xn,-))' is an nd-dimensional Brownian motion (with 
suitable mean and covariance) for any {xi, . . . ,Xn) € K""*. If F is as defined by (1.3) 
and {0t}o<t<i is a continuous IR''-valued {J^tj-adapted stochastic process, the stochastic 
integral j^F^cj)^, dr) is a well-defined d-dimensional continuous {J^t}-adapted stochastic 
process (see [15], Chapter 3, Section 2, pages 71-86). 

Definition 2.3. Let F be as in Definition 2.2. Then for each s € [0,T] and x G W^, 
there is a unique continuous Tt-adapted, M.''' -valued process (t)s,t{x), s <t <T satisfying 
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4>s,t(x) = x + J* F{(j)s^r{x), dr),t£ [s,T]. This stochastic process is called the solution of 
Ito's stochastic differential equation based on the Brownian motion F. 

From [15], Theorem 4.6.5, it follows that {(t>s^t}o<s<t<T as introduced in Definition 2.3 
has a modification that is a forward stochastic flow of C'^-diffcomorphisms. 

3. Large deviation principle 

Given £ > 0, let i^^ be a C'^''^-Brownian motion on some filtered probability space {fl,J-, 
P, with local characteristics (ea,6), where (fc,J^) and (a, 6) are as in Section 2. 

Without loss of generality we assume that F'^ is represented as 

F'{x,t)^ b{x,r)dr + V^y2 fi{x,r)d(3i{r), G M'^ x [0,T], (3.1) 

Jo Jo 

where (f3i,fi)i>i are as in Section 2. Note in particular that 

i^^(x,t)- / b{x,r)dr^y/eM{x,t). 
Jo 

With an abuse of notation, when e = e„ we write F'^ as F". Observe that {{M{x, •), /3;(-)))t = 
J^fi{x,r)dr for all < e [0,T], a.s. Let 0= = {<j>lt{x),0 < s <t<T,xeR'^} be the forward 
stochastic flow of C'^-diffeomorphisms based on F^ . With another abuse of notation, we 
write 0g J as c/if and ^i''' = (a;), < t < T, x € R'*}. 

The goal of this paper is to show that the family (cj)'^ , F^)^yo satisfies an LDP on a 
suitable function space, as e 0. We begin by recalling the definition of a rate function. 

Definition 3.1. Let £ be a Polish space. A function /:£"—> [0,cxj] is called a rate func- 
tion on £, if for each M < oo the level set {x € £:I{x) < M} is a compact subset of 
£. 

We remark that some authors define a rate function by replacing the requirement of 
compactness of level sets by the requirement that these level sets be closed and refer to 
a rate function satisfying the property of compact level sets as a "good rate function" . 

For m e N, let G™ be the group of C'"-diffeomorphisms on M.'^. G™ is endowed with 
the metric 

d™(0,^) = A™(0,7/.) + A™(0-\V"'), (3.2) 

where 

A^(0,^)= J2 p(5"</'>^"V'), 

|q:| <m 

(3.3) 
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N-. 



- 2~l + sup|,|<^|(/.(x)-V'(x)|' 



Under this metric G™ is a Polish space. Let W„, = C([0,T]:G'") be the set of all 
continuous maps from [0,T] to G™ and = G([0, T] : C"(R'')) be the set of all 
continuous maps from [0,r] to C™(K''). The space Wm endowed with the metric 
— ^'^Po<t<T ^m{'t'{t),'4'{t)) ^'Hd the space Wm with the metric (i,„((/), '0) — 
suPq<j<7^ Am(0(t), are Polish spaces. Note that {(j)^,F^) belongs to Wk x Wk Q 
Wk-i X Wk-i C Wk-i X Wk-i- We win show that the pair (0^,F^)e>o satisfies LDPs 
in both of the spaces Wk-i x Wk-i and Wk-i x Wk-i, with a rate function / that is 
introduced below. 

Let u= {iti}^j G UAr>i -^A^I^a]- Given any such control, we want to construct a cor- 
responding controlled fiow in the form of a perturbed analogue of (3.1). Observe that 
Zt = X^i^i /o ''^li^) d/3i(s) is a continuous square integrable martingale. For any 7 < 5 one 
can find : M'' x [0, T] x ^ R'^ such that 6„ (t , w) € C^-^ (R'*) for a.e. (i , w) , such that for 
each X S R"*, •) is predictable, and such that bu{x, s) ds = {{Z, M{x, ■)))t for each 
(x,t) e R'^ X [0,T]. In particular, for each x G R'',6„(x,t) = ^'■e- it,uj). 

Furthermore, for some cG (0,oo), 

00 

||6u(t)|ll^<c||a||?.fe.5^|HKt)P, [dt®P]-a.e. in(i,a.). (3.4) 

1=1 

The proofs of these statements follow along the lines of Exercise 3.2.10 and Lemma 3.2.3 
of [15]. Next, define 

F^'"{x,t)= [ bu{x,s)ds+ [ b{x,s)ds. (3.5) 
Jo Jo 

It follows that F*''"(-,<) is a C'^''''(R'*)-valued continuous adapted stochastic process. Let 
bu = bu + b and for {to, x) G [0, T] x R"* let {<Pto^ti^)}to<t<T be the unique solution of the 
equation 



4:Zt {x)^x+ bu {4>tol (x) , dr, t€[to,T]. (3.6) 

Jto 

From [15], Theorem 4.6.5, it follows that {C't",0 < s<t<T} is a forward flow of C - 
diffeomorphisms . 

For (</>°,F0) G Wk X Wk define 

/(0°,i^°)= mf ir|j^(.)||?^ds, (3.7) 

u£C(4>o,Fo} I Jo 

where C{(j)^,F°) = {w G ^^([o, T] : ^2) | (0",i^°) = (f^"'", i^°'")}. Note in particular that u 
in (3.7) is deterministic. If (0°, -F°) G (W^fc-i x l^fc-i) \ (Wk x W^fc) then we set I{(I>°,F°) = 
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oo. We denote the restriction of / to Wk-i x Wk-i by the same symbol. The following 
is the main result of the section. 

Theorem 3.2 (Large deviation principle). The family (0'^,F^)£>o satisfies an LDP 
in the spaces Wk-i x Wk-i and Wk-i x Wk-i with rate function I . 

Let {un}'^^i{un = {"""li^i) be a sequence in y^jv[^2] for some fixed N <oo. Let {e„}„>o 
be a sequence such that e„ > for each n and e„ — as n oo. Note that we allow 
e„ = for all n. RecaU that M{x,t) = J^Zi So M^^"^) d/3»(?-), ix,t) e R'' x [0,T]. Define 

F"{x,t)^ f buAx,r)dr + ^,M{x,t) (3.8) 
Jo 

and let 0" be the solution to 

</)r(x)=x+/ buA€ix),r)dr + ^J M {cj,'; {x) , dr) . (3.9) 
Jo Jo 

Clearly F" € Wk, and from [15], Theorem 4.6.5, equation (3.9) has a unique solution 
(/)" S Wk a.s. We next introduce some basic weak convergence definitions. 

Definition 3.3. Let u e ^^[^2] and {(/)"} he as above. Let ^^k-i^^l-i measures 
induced by (0",/'"), (f/)"^", i^°'"), respectively, on Wk-i x Wk-i- Thus for A G B{Wk-i x 
VFfc_i), 

P^_i(A) = P((^",F") € A), PLi(^) P(('?!>°'", i^"'") € ^)- 

The sequence {(</>", ^^")}n>i *s said to converge weakly as G^^^ -flows to {(fP'^ , F^'"^) as 
n — > 00 i/ converges weakly to ^l-i '^'^ n — > 00. 

Definition 3.4. Let ¥]^_^,Fl_^ he the measures induced by (</)", F"), (0°-", F°'"), re- 
spectively, on Wk-i X Wk-i- The sequence {((/)", F")}„>i is said to converge weakly as 
C''^^ -flows to {(//'•'^ , F^'") as n —> 00 i/P^_j converges weakly to P^-i as 71— >-oo. 

As noted in the Introduction, proofs of large deviations properties based on the general 
framework developed in [8] essentially reduce to weak convergence questions for controlled 
analogues of the original process. For our problem the following theorem gives the needed 
result. The proof is given in the next section. 

Theorem 3.5. Let converge to u in distribution as an SN[h]-'>'0'lued sequence of 

random variables. Then the sequence {((/)", F")}„>i converges weakly as -flows and 
G^^^ -flows to the pair (0°'",^°'") as n—>- 00. 

The following theorem is taken from [8] . Let M°° denote the product space of countably 
many copies of the real line. Then S = C([0, T] : R°°) (with the usual topology) is a Polish 
space and /? = is an 5*- valued random variable. 
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Theorem 3.6. Let £ he a Polish space, let {5^}e>o he a collection of measurable maps 
from {S,B{S)) to {£,B{£)), and letX^ = Q^{y/ej3). Suppose that there exists a measurable 
map - S ^ 8 such that for every N < oo the set Tn = {^/°(/o u{s) ds) : u S S'7v[^2]} is 
a compact subset of £ . For f e£ let Cf = {ue L'^{[0,T]:P) : f = G^iJ^Usds)}. Then L 
defined by 

is a rate function on £. Furthermore, suppose that for all N < oo and families 
{u'^} C -4Ar[Z2] such that converges in distribution to some u G ^^[^2]. voe have 
that t/^(-\/£/3(') + /o^'^('S)ds) — > it(s) ds) in distribution as e— > 0. Then the fam- 

ily {X^ ,£ > 0} satisfies the LDP on £ , as e — > 0, with rate function L . 

Proof of Theorem 3.2. We will only show that the sequence {(jf ,F^) satisfies an LDP 
in Wk-i X Wk-i with rate function / defined as in (3.7). The LDP in Wk-i x Wk-i 
follows similarly. Let : S Wk-i x Wk-i be a measurable map such that Q^{-\/e(3) = 
(0^, F'^) a.s., where F^ is given by (3.1) and (/)^ is the associated flow based on F"^ . Define 
g^:S^Wk-i X Wk-i by g°{J^u[s)ds) = {(jP ,F^) li u € L'^{[Q,T]:l2) and with <iP,F° 
as deflned in (3.6) and (3.5), respectively. We set Q^{f) = for all other f £ S. 

Fix N < 00 and consider — {G'^{Jq u{s) ds), w G Sn[12\}- We first show that Fjv is a 
compact subset of Wk^i x Wk-i- It suffices to show that if m„,m G Sn[12\ arc such that 
Un — >■ u, then fj"(/p u„(s) ds) — >■ 5°(/pM(s)ds) in Wk-i x Wk-i - This is immediate from 
Theorem 3.5 on noting that g°{J^ u„(s) ds) = (0", F"), where (^", F" are as in (3.9) and 
(3.8), respectively, with e„ = and g^^{J^u{s)ds) = (0°'",^°^"), where 0O,«^^o,« 
in (3.6) and (3.5), respectively. 

Next let {un} C y^Ar[/2] and e„ G (0,oo) be such that e„ — > and u„ converges in 
distribution to some u as n — > 00. In order to complete the proof, it is enough, in 
view of Theorem 3.6 and the definition (3.7), to show that G^" {y^f^ + /gW„(s)ds) — !• 
t/''(/p ii(s) ds) in Wk-i X Wk-i, as n— J- 00. An application of Girsanov's theorem shows 
that e^^"(Ve;r,3 + /oU„(s)ds) = (0",F"), where are defined as in (3.9) and (3.8), 

respectively. Also G*^ {Jqu{s) ds) — (1^°'", F°'"), where (^"'"^i^O-" are the same as in (3.6) 
and (3.5), respectively. The result now follows from Theorem 3.5. □ 

4. Proof of Theorem 3.5 

This section will present the proof of Theorem 3.5. It is worth recalling the assumptions 
that will be in effect for this section, which are that {«„} is converging to u in distribution 
as an S'Ar[^2]-valued sequence of random variables and that (a, b) G C^-'^(R''^'^) x C^''*(R'^), 
for some fc G N and S £ (0,1]. 

We begin by introducing the (m+p)-point motion of the fiow and the related notion of 
"convergence as diffusions" . Let x = (xi , X2 , . . . , Xm ) and y = (?/i , j/2 7 • . . , 2/p ) be arbitrary 



244 



A. Budhiraja, P. Dupuis and V. Maroulas 



fixed points in K'^x™ and R'^-^p, respectively. Set 



0r(x) = (</>r(xi),0r(^2),...,</>r(^™)) 

and 

F" (y , t ) = (F" (y 1 , t ) , F" (2/2 , i ) , . . . , (2/p , i ) ) ■ 

Then the pair {0"(x), F"(y, t)} is a continuous stochastic process with values in R'^^™ x 
M'^^P and is called an (m+p)-point motion of the flow. Let Vm = C([0, T] : R'*^™) be the 
Frcchct space of all continuous maps from [0,T] to K'^^™, with the usual seminorms, and 
let Vm,p = VmXVp he the product space. 



Definition 4-1- Let P"x y) ^'^'^ ^(x y) measures induced by (0"(x), F"(y)) and 

(0O'"(x),F"^"(y)), respectively, onV^^p. Thus for A e B{V„,,p) , 

Pf,,^) =P((0"(x),F"(y)) e A), P^-^^y) =P((0O'"(x),F"^«(y)) e A). 

The sequence {(0",F")}„>i is said to converge weakly as diffusions to ((/j*^'", F"'") as 
n — > cxj j/P"^ converges weakly to P°^ as n —J' oo /or each (x, y) e M^^^"' x R''^p, awd 
m,_p= 1,2,.... 



The following well-known result (cf. [15], Theorem 5.1.1) is a key ingredient to the 
proof of Theorem 3.5. 

Theorem 4.2. The family of probability measures f"^_^(respectively, PJJ^j) converges 
weakly to probability measures V^_^{respectively, as n ^ go if and only if the 

following two conditions are satisfied: 

1. the sequence {(0",F")}„>i converges weakly as diffusions to ((/)°'", F'^^") as n — > c», 

2. the sequence {P^^_i} {respectively, {¥^_i}) is tight. 

We will show flrst that under the condition of Theorem 3.5 the sequence {(</>", F")}„>i 
converges weakly as diffusions to (^^'^jF"'") as n — !• oo. We begin with the following 
lemma. 



Lemma 4.3. For each a; £ M'^ 



. sup 

0<t<T 



V / fk{x,s)dPk{s) 

/ fk{ci>:{x),s)dPk{s) 



supE sup 

n 0<t<T 



< OO, 



< 00. 



(4.1) 
(4.2) 
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Proof. We will only prove (4.2). The proof of (4.1) follows in a similar manner. From 
the Biirkholdcr-Davis-Gundy inequality the left-hand side of (4.2) is bounded by 



ciE 



= ciE 


i 







Tr (a ((/)r (x) , 0r (a;) , r) ) dr 



< C2|la||?, 



fc,<5- 



The last expression is finite since a belongs to (R'^^'^). 

An immediate consequence of Lemma 4.3 is the following corollary (cf (3.8), (3.9)). 
Corollary 4.4. For each x eR'^ and t G [0,T], 



□ 



F"(x,0= / hu„{x,r)Ar + Sn{x,t) 
Jo 



and 



h'^{x),r) dr + T„ix,t), 



where Sn{x,-) and T„(a;, •) are continuous stochastic processes with values in R"^ , sa<is- 
fying supQ^f^j^{\Sn{x,t)\ + \Tn{x,t)\} in probability asn^oo. 

The following lemma, showing the tightness of P"^ , plays an important role in the 
proof of the weak convergence as diffusions. 

Lemma 4.5. For each x eR"^ the sequence {{(/)" (x), F"{x))}n>i is tight in C{[0,T] ■.R'^ x 

TU>d\ 



Proof. We will only argue the tightness of {(/)"(x)}. Tightness of {F'^{x)} is proved 
similarly. Corollary 4.4 yields that T'„(x, •) is tight in C([0, T] : R''). Thus it suffices to show 
the tightness of {Jg 6^^ (0"(a;), r) dr}. Fix p> 0. From the Cauchy-Schwarz inequality, 
(3.4) and recalling that u„ €.4Ar[Z2], E| J^* (0"(a;), r) drj^* can be bounded by 



E 



1P/2 



|6„„(0:?(a;),r)pdr 



<C2it^sr^^ 



the result follows. 



□ 



Proposition 4.6. Let Un^u in distribution as S M[l2]-'valued random variables. Then 
the sequence {((/)", -F")}„>i converges weakly as diffusions to (0'^'", i^*^'") as n^oo. 
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Proof. We claim that it suffices to show that for each t E [0,r], the map 



(4.3) 



from C([0, T] : M'') x Sn[12] to R'' is continuous. To see the claim, note that in view of the 
tightness established in Lemma 4.5, the proposition will follow if any weak limit point 



b,F,u) of {(j)"-,F",Un) satisfies, for each fixed {t,x) e [0,T] 



F{x,t)= / bu{x,r)dr, (/)t(a;)=.T+ / bu{(f)rix),r) dr 



(4.4) 



Now fix a weakly convergent subsequence and {t,x) £ [0,T] x R**. From (4.3) (0"(x), 
/o ^«r, (2;): ^) d?') converges weakly in R'^ x R'^ to {4>t{x), /g bu{(t>r{x),r) dr). The second 
equality in (4.4) is now an immediate consequence of the second equality in Corollary 4.4. 
The first equality in (4.4) is proved similarly on noting that (4.3) in particular implies 
that the map u i— > bv{x, s) ds, from S'jv[^2] to R'', is continuous. 

We now prove (4.3). Let (^„,u„) {£.,v) in C{[0,T]:R'^) x SNih]- Then, 



ibvAC.s)-U^s,s))ds 



< 



ibvAC.s)-bvMs,s))ds 



ibv„i^s,s) -by{^s,s))ds 
L1 + L2. 



(4.5) 



For each a: <E R'' we have that 
t 

{by^ {x, s) — by{x, s)) ds 
since w„ — J- u weakly in L^([0, T] : I2) and 



00 „( 

/ fi{x,s){vf{s)^vi{s))ds 

"'0 



(4.6) 



Furthermore from (3.4) (recall fc > 1) we have that for some ci e (0,cx)) and all x,y G 
R'^,0<t<T, 



(6„„(a;,s) - by^{y,s))ds 



<ci\x- y\ 



(4.7) 



Using the Ascoli-Arzela theorem (in the spatial variable) and equation (4.6), (4.7) now 
yields that the expression on the left-hand side of (4.6) converges to uniformly for x 
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in compact subsets of R''. Thus L2 — J' as n — ?> 00. Following similar arguments Li is 
bounded by C2 supg<s<7^ — £,s\, which converges to as n 00. Hence (4.5) converges 
to as n — >■ cx) and the result follows. □ 

We next show the tightness of the family of probability measures {P^.j^}. Key ingre- 
dients in the proof are the following uniform L^'-estimatcs on d°'F'^{x,t) and d°'(j)^{x). 

Lemma 4.7. For each p>l there exists ki G (0, 00) such that for all t, t' G [0, T],x G W^, 
n>l and \a\ < k: 

E\d°'F''{x,t) - d°'F'\x,t')\P < ki\t-t'\P/^. (4.8) 

Proof. Fix a multiindex a such that |a| < fc and p> I. Using the Biirkholder-Davis- 
Gundy inequality for the martingale d°'M{x, •) and the fact that a e C^'*(M''), we obtain 
that for some Ci G (0, 00) and aU x eR''',t,t' e [0,T], 

E\d°'M{x,t) - d"M{x,t')\P <ci\t~ t'\P/^. (4.9) 

Recalling that bu„{-,t) e C''^'^(R'') a.c. {t,uj) and using (3.4) we get 

/ sup |9"6„„(x,r)| dr < 00 a.e., 
Jo xeR'' 

and thus d" bu„ {x, r) dr ~ d^b^^ {x, r) dr a.c. An application of the Cauchy-Schwarz 
inequality and (3.4) now gives, for some C2 G (0,oo), 

<C2\t-t'\P/^. (4.10) 



9" / bu„ix,r)dr 



Equation (4.8) is an immediate consequence of (4.9) and (4.10). □ 



For g-.R''- X [0,T] M'', let Vyg{y,r) be the dx d matrix with entries \V yg{y,r)]i 



-^gi{y-,r). Differentiating with respect to xi in (3.9) we obtain 



di^'^{x)=dix+ / [V^6„„(C(.T),r).aiC(x)]dr 
Jo 

+ VyM{(p'^{x)Ar)-dirr{^) 
Jo 

^dix+ f WyF'\cj)"^{x),dr)-dir^{x). 



By repeated differentiation one obtains the following lemma whose proof follows along 
the lines of Theorem 3.3.3 of [15]. Given < m < fe, let be the set of all multiindices 
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a satisfying \a\ < m. For a multiindex 7, denote by m{'-f) = tt{7o : |7o| < |7|}- Also for a 
|7|-times differentiable function ^':R'^— 7>M, denote by the m(7)-dimensional 

vector with entries d'''°'^{x), I70I < |7|. If ^' = *2, . . . , : R'^ -)> R'^ is such that each 
*i is |7|-tinies continuously differentiable then = . . . , 9-l'^l*d(a;)). 

We will call a map P:M™ — > M'' a polynomial of degree at most p if P{x) = 
{Pi{x), . . . ,Pd{x)y and each PitM™ — > M is a polynomial of degree at most p. Also for 
u,v we define 

u*v = {uivi,. . .,um)' . 



Lemma 4.8. Let a, 13,^ be multiindices such that \a\,\P\,\j\ < k. Then there exist 
subsets Al^,A^ 0/ A|q,| and A\a\-i, respectively, a subset ^ 0/ A|^| and polynomials 
P^,^ : R™(''') ^ R'^ of degree at most \a\, such that 9^0" satisfies: 



^° , (4.11) 

where for x, y G R"*, G"(x, y, r) = VyF"" {y, r) ■ x and for (x, y) e R"('') x R'', G^;"(a;, y, r) = 
Pl,{x)*dlF^{y,r). 

Note in particular that in the third term on the right-hand side of (4.11), one finds 
partial derivatives of (/)"(a;) of order strictly less than |a|. 

Lemma 4.9. For each p>l,LE (0,oo), there is a constant ki = ki{k,p,L) g (0,oo) 
such that for every multiindex a, |a| < k, 

sup sup E sup \d°'(j)'^{x)\P < ki (4.12) 

n |2;|<L 0<t<T 

sup sup E|a"<?!)J'(a;)-a"<?!)J',(a;)|P < ki\t-t'\P/\ (4.13) 

" |x|<L 

Proof. Fix i > and consider x gM.'^ such that |a;| < L. We will first show inequality 
(4.12). It suffices to prove (4.12) for a = and establish that if, for some m < fc, it holds 
for with |a| < m and all p > 1 then it also holds for di with all p > 1 (with a 

possibly larger constant ki) and i = 1, . . . , d. The desired result then follows by induction. 

Consider first a = 0. For this case the bound in (4.12) follows immediately on us- 
ing (3.4) and applying the Biirkholder-Davis-Gundy inequality to the square-integrable 
martingale Nt = /„* A/(0;; (x), dr) [note that {{N))t = /g a((/);?(x), (x), r) dr and aS 
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Now, suppose that (4.12) holds for all multiindices a with \a\ < m, for some m < k. 
Fix a with \a\ < m, an z G {1, 2, . . . , d}, and consider the multiindex 5 = a + li, where Ij 
is a d-dimensional vector with 1 in the ith entry and elsewhere. From Lemma 4.8, one 
finds that d°'(j)'^ solves (4.11) for a = a. Note that for /3 € Ai , 

d^F^iy, t) = f d^buiy, s) ds + V^a^M(y, t). 



From (3.4) and recalling that (5, a) e C^' (M'') 
ci,C2 e (0,oo), 



we have that for some 



sup sup 

0<t<Ty£Md 



d^buiy,s)ds <ci and sup sup \{{d^M{y,t)))t\ <C2. 

0<t<T, - ■ 



This along with the assumption 



sup sup E sup \d''cl)'^{x)\P < ki for i^, \i^\ < \a\ 

n \x\<L 0<t<T 



shows that for some C3 G (0,oo), for all (/3,7) £ A\ x A 



sup sup E sup 

n \x\<L 0<t<T 



<C3. 



In a similar manner one has for some C4 G (0, 00) 

p 



E sup / G"(a"(/);?(x),0;?(a;),dr) 

0<t<T Jo 

Combining the above inequalities we obtain 



<C4 / E sup \d°'(t)';ix)\P)ds. 

Jo ^0<r<s 



sup sup E sup |a"0^(a;)|P<C3 + C4Sup sup / E sup Id" cj)"^ {x)\p ] ds . 

n \x\<L 0<s<t n \x\<lJo ^0<r<s ' 

Now an application of Gronwall's lemma shows that for some C5 G (0,oo), 

sup sup E sup |9"0^(a;)|P<C5. 

n \x\<L 0<t<T 

This establishes (4.12) for all a with |a| < |a| + l. Finally consider (4.13). For t,t' G [0,T], 
t' < t, we have from (4.11) that 



5"</.r(2^) -5"0r'(-^) = / G"(a"C(^),C(3;), dr) 



E fG;-;{d^\''\<j^^{x),<i^^ix),dr). 



(4.14) 



(/3,7)6A^xA2 
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Using (4.12) on the right-hand side of (4.14) we now have (4.13) via an application of 
Holder's and Biirkholder-Davis-Gundy's inequalities. □ 

The proof of Theorem 3.5 proceeds along the lines of Section 5.4 of [15]. We begin by 
introducing certain Sobolev spaces. Let j be a non-negative integer and let 1 < p < oo. 
Let Bn = 5(0, N) be the R'^-ball with center the origin and radius N. Let h:R'' ^R'' 
be a function such that the distributional derivative d^h E LP{B]^) for all a such that 
\a\ < j. Define 




The space i?j°p = {/iiffi.'' — > K'', ||/i|jj.p:7v < oo for all N} together with the scminorms 
defined above is a real separable semireflexive Frechet space. By Sobolcv's imbed- 
ding theorem, we have Hj°^j^p C C-'XR'^) C ^^]°p ii p > d. Furthermore the imbedding 
i : Hj°^^ p — > C^ (M'*) is a compact operator by the Rellich-Kondrachov theorem (see [1]). 

Proposition 4.10. The sequence {(0", i^")}„>i is tight in Wk-i x Wk-i 

Proof. It suffices to show that both {(j>"}n>i and {-F'"}n>i are tight in Wk-i- We will 
use Kolniogorov's tightness criterion [15], Theorem 1.4.7, page 38. From Lemmas 4.7 
and 4.9; we have that for each p> 1,N > 1, there exist ci,C2 € (0,oo) such that for all 

[O^T] 

supE||0^-'^^'lllp:^^<cl|^-^'l^/^ 

n 

snpE\\F-{;t)-F-{-X)\\i,..j,<C2\t-t'r/'. 

n 

Furthermore, since F"(-,0) = and <f>Q{x) = x, we get that for each p > 1, A'' > 1 there 
exist C3,C4 € (0,oo) such that 

supEUnip:N<C3 and supE||/'"(t)|lP < C4. 

n n 

Applying Theorem 1.4.7 of [15] with p> 2 now gives a tightness of {0"}„>i and {F"}n>i 
in the semi- weak topology on C([0, T] : (cf. [15]). Since the imbedding map i : H^^ — >■ 

C'""""'^ is compact, tightness in Wk-i x Wk-i with the topology introduced in Section 2 
follows (see [15], pages 246-247). □ 

Recall the definitions (3.2) and (3.3). For the proof of the following lemma we refer 
the reader to Section 2.1 of [4]. 

Lemma 4.11. Letfn,fGWk-i be such that sup^^f^rp Xk^i{fn{t), f{t)) 0, as n—>- 00. 
Then supo<j<j. (ifc_i(/„(i), /(t)) -> 0. 



LDP for stochastic flows 



251 



Proof of Theorem 3.5. Convergence as C'^~^-flows is immediate from Theorem 4.2, 
Propositions 4.6 and 4.10. Using Skorokhod's representation theorem, one can find a 
sequence of pairs {((/i", F")}„>i that has the same distribution as {(i^", F")}„>i and 
{{(iP,F^)} that has the same distribution as and supo<t<j'[Afe(0", 0j ) + 

Afc(/'"(t),F°(t))] 0, a.s. Since (j)",(jp € Wk a.s., the same holds for ^",(^°. Thus from 
Lemma 4.11 supo<t<i. 4-^1 (0^ 0?) ^ a.s. Hence (</>", F") ^ (<?!)°, F") as G'^-i-flows. □ 

5. Application to image analysis 

A common approach to image matching problems (sec [12, 14, 17] and references therein) 
is to consider a M^'-valued, continuous and bounded function T(-), referred to as the "tem- 
plate" function, defined on a bounded open set O C R-^, which represents some canonical 
example of a structure of interest. By considering all possible smooth transformations 
h:0 O one can generate a rich library of targets (or images) given by the form T{h{-)). 

In typical situations we are given data generated by an a priori unknown function 
h, and the key question of image matching is that of estimating h from the observed 
data. A Bayesian approach to this problem requires a prior distribution on the space 
of transformations and a formulation of a noise/data model. The "maximum" of the 
posterior distribution on the space of transformations given the data can then be used 
as an estimate h for the underlying unknown transformation h. In certain applications 
(e.g., medical diagnosis), the goal is to obtain numerical approximations for certain key 
structures present in the image, such as volumes of subrcgions, curvatures and surface 
areas. If the prior distribution on the transformations (and in particular the estimated 
transformation) is on the space of diffcomorphisms, then this information can be recov- 
ered from the template. Motivated by such a Bayesian approach, a variational problem 
on the space of C™-diffeomorphic flows was formulated and analyzed in [12]. 

Before going into the description of this variational problem, we note that although 
the chief motivation for the variation problem studied in [12] came from Bayesian consid- 
erations, no rigorous results on relationships between the two formulations (variational 
and Bayesian) were established. The goal of our study is to develop a rigorous asymp- 
totic theory that connects a Bayesian formulation for such an image matching problem 
with the variational approach taken in [12]. The precise result that we will establish is 
Theorem 5.1, given at the end of this section. The result is an application of Theorem 
3.2 for local characteristics (a, b) = (a, 0) and a £ <C^'^^^{M!^^^), with k — m — 2. 

Let C^{0) be the space of infinitely differentiable, real- valued functions on O with 
compact support in O. The starting point of the variational formulation is a differential 
operator L on [C^(C')]"^; the exact form of which is determined from specific features of 
the problem under study. The formulation, particularly for problems from biology, often 
uses principles from physics and continuum mechanics as a guide in the selection of L. 
We refer the reader to Christenscn et al. [9, 10], where natural choices of L in shape 
models from anatomy are provided. 
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Define the norm j] • |1l on [C^{0)]^ by 

ll/lli = E / \iLfUu)\'du, 
Jo 



lo 

where we write a function g € [C^(C)]'^ as 32, ffa)'- It is assumed that generates 
an inner product on [C^(C')]^ and that the Hilbert space H defined as the closure of 
[C5"(C')]'^ with this inner product is separable. We will need the functions in H to have 
sufficient regularity and thus assume that the norm || • \\l dominates an appropriate 
Sobolcv norm. More precisely, let W™'''^'^(C') be the closure of C^(C') with respect to 
the norm 



j E la-ffHI'du)'^', geCo"(0), (5.1) 



where a denotes a multiindex and m > 3. Define Vm = [W^™^^'^(C')]'^'^, where (8) is used 
to denote the usual tensor product of Hilbert spaces. We denote by || • ||v,„ the norm on 
Vrn- The main regularity condition on L is the following domination requirement on the 
II • ||l norm. There exists a constant c S (0,oo) such that 

||/||l>c||/||v„, forall/G[Co°°(0)]3. 

This condition ensures that H C C™'i/2(0) (see [1], Theorem 4.12, parts II and III, page 
85). We denote by H the Hilbert space ^^([o, 1]:H). For a fixed deH let {r]s,t{x)}s<t<i 
be the unique solution of the ordinary differential equation 

^^^=i?(7?,,t(x),<), V.Ax)=x, 0<.s<t<l. (5.2) 

Then it follows that {r/s^t, 0< s <t<T} is a forward flow of C^-diffcomorphisms on O 
(see [15], Theorem 4.6.5, page 173). Since i?(-,t) has a compact support in O, one can 
extend rjs^t to all of M'^ by setting 77s, t (a;) =x, if a; g O^. Extended in this way rjs^t can be 
considered as an element of G™, as defined in Section 3. Denoting 770,1 by h^, we can now 
generate a family of smooth transformations (diffcomorphisms) on O by varying 1? G "H. 
Specifically, the library of transformations that is used in the variational formulation of 
the image matching problem is {h^ \ 1? € H}. 

We now describe the data that is used in selecting the transformation h^* for 
which the image T{h^*{-)) best matches the data. Let £ be a finite index set 
and {Xijig/; be a collection of disjoint subsets of O such that Uig£^i = '-'Ol- 
lected data {di}i^c represents integrated responses over each of the subsets Xi,i € 
C. More precisely, if T(h(-)) was the true underlying image and the data were 
completely error free and noiseless, then di = Jj^ T{h{a))da/vol{Xi), iGC, where 
vol denotes volume. Let d = {di,d2, ■ ■ ■ jdn)' , where ?z = |£|. Defining Yd{x) = di, 
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a; e Xi, i g C, the expression 

i / \T{h^{x))-Yd{x)\''dx 
^ Jo 

is a measure of discrepancy between a candidate target image T{h^{-)) and the obser- 
vations. This suggests a natural variational criterion for selecting the "best" transfor- 
mation matching the data. The objective function that is minimized in the variational 
formulation of the image matching problem is a sum of two terms, the first reflecting the 
"likelihood" of the transformation or change-of- variable and the second measuring the 
conformity of the transformed template with the observed data. More precisely, define 

JdW = \ {\\nu + \T{h^{x)) - dx^ . (5.3) 

Then e argmin^g^ Jdi'ff) represents the "optimal" velocity field that matches the data 
d and for which the h^* , obtained by solving (5.2), gives the "optimal" transformation. 
This transformation then yields an estimate of the target image as T(ft,^.(-)). Equiva- 
lently, defining for each /i S , 

Jd(/i)= inf Jdi-ff) {where ^h^{^eH:h^h4), (5.4) 

we see that an optimal transformation is h* ~ h^* G argmin^ Jd{h). 

Up to a relabeling of the time variable, the above variational formulation (in particular 
the cost function in (5.3)) was motivated in [12] through Bayesian considerations, but no 
rigorous justification was provided. In [12] the orientation of time is consistent with the 
change-of- variable evolving toward the identity mapping at the terminal time. To relate 
the variational problem to stochastic flows it is more convenient to have the identity 
mapping at time zero. Wc next introduce a stochastic Bayesian formulation of the image 
matching problem and describe the precise asymptotic result that we will establish. 

Let be a complete orthonormal system in H and /3 = (/3i),^i be as in Section 
2, a sequence of independent, standard, real-valued Brownian motions on some flltered 
probability space (17, J-,P, {J-J). Write (C([0,r] :R°°),e(C([0,r] :M°°))) = (5,5), and 
note that /3 is a random variable with values in S. Consider the stochastic flow 

oo 

dV's.t(a;) = Vi^0.(V's.t(a;))d/?,(t), ijs,s{x)=x, xeO,0<s<t<l, (5.5) 

i=l 

where e € (0, oo) is fixed. From Maurin's theorem (see [1], Theorem 6.61, page 202) it fol- 
lows that the imbedding map H — )■ Vm-2 is Hilbert-Schmidt. Also, Vm-2 is continuously 
embedded in C™-2'V2(0). Thus for some ki,k2 G (0, oo) and all u,x,y G O, 



oo oo 
i=l i=l 
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^ - Uy)\' <h\x- ^k2\x^ y\\ 

i=l 4=1 

One also has that if is extended to all of R'^ by setting (j)i{u) = 0, for all x E C^, then 

a{x, y) = J2Zi MxWiiy) is in C^'^'^^^R^''^). Thus it follows (cf. [15], pages 80 and 
106) that 

OO „t 

1=1 -^0 

is a C"'~^'''-Brownian motion, < i/ < 1/2, with local characteristics (a,0). Also (5.5) 
admits a unique solution {ip^f.{x),0 < s < t < 1} for each x £ O and {V's t}o<s<t<i is 
a forward flow of C'^-diffeomorphisms, with k = m — 2 (see [15], Theorem 4.6.5). In 
particular, X'^ == V'o i is 8- random variable in the space of C'^-diffeomorpshisms on O. 
The law of (for a fixed e > 0) on G'^ will be used as the prior distribution on the 
transformation space G^. Note that T{X^{-)) induces a measure on the space of target 
images. 

We next consider the data model. Let C and n be as introduced below (5.2). We 
suppose that the data is given through an additive Gaussian noise model: 

A = / T(X"(x))dx + ^/i^„ 

where G £} is a family of independent, p-dimensional standard normal random 
variables. 

In the Bayesian approach to the image matching problem one considers the posterior 
distribution of X^ given the data D and uses the "mode" of this distribution as an 
estimate for the underlying true transformation. More precisely, let {r'^}£>o be a family 
of measurable maps from E"p to V{G'') (the space of probability measures on G''), such 
that 

r'^(^|£') V[X^ e A\D] a.s. for alU G B{G''). 

We refer to r^(-|d) as a regular conditional probability distribution (r.c.p.d.) of X^ given 
D = d. In Theorem 5.1 below, we will show that there is a r.c.p.d. {r'^{-\d),d G R"^}£:>o 
such that for each d £ IR"p, the family {r^{-\d)}e>o, regarded as elements of V{G''~^) 3 
'P{G^), satisfies an LDP with rate function 

Id{h) = Jd{h) - Xd, where = inf Jd{h) = inf Jd{i))- 

hdGi'-^ -den 

Formally writing r'(A|(i) w e^^'''''^/'^ d/i, one sees that for small e, the "mode" of the 
posterior distribution given D — d, which represents the "optimal transformation" in the 
Bayesian formulation, can be formally interpreted as aigmini^Id{h). Note that Jd{h) ~ oo 
if ft, ^ G™ (recall m = fc + 2). Theorem 5.1 in particular says that h G G™ is a 5-minimizer 
for Id{h) if and only if it is also a (5-minimizer for Jd{h). Thus Theorem 5.1 makes precise 
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the asymptotic relationship between the variational and the Bayesian formulation of the 
above image matching problem. 

Theorem 5.1. There exists an r.c.p.d. such that for each d£W^, the family of 
probability measures {T^{d)}e>o on G^~^ satisfies a large deviation principle (as e^Q) 
with rate function 

Id{h)^Jd{h)-\d- (5.6) 
We begin with the following proposition. Let / : G^^^ — s- [0, oo] be defined as 

Proposition 5.2. The family {X^^^y^ satisfies a LDP in G''~^ with rate function L. 

Proposition 5.2 is consistent with results in Section 3 in that although the local char- 
acteristics are in C*^ and E G*^, the LDP is established in the larger space G^~^ . 
This is due to the tightness issues described in the Introduction. Furthermore, as noted 
below (5.2), if WiiWy. < oo then induces a flow of C™-diffeomorphisms on O. Thus if 
h e G*^"^ \ G™ then 4' ft, is empty, and consequently L{h) = oo. Hence there is a further 
widening of the "gap" between the regularity needed for the rate function to be finite 
and the regularity associated with the space in which the LDP is set. This is due to 
the fact that the variational problem is formulated essentially in terms of L^ norms of 
derivatives, while in the theory of stochastic flows as developed in [15] assumptions are 
phrased in terms of L°° norms. 

Proof of Proposition 5.2. From Theorem 3.2 and an application of the contraction 
principle we have that {X^Jg^o satisfies LDP in G'^"^ with rate function 

I*{h)= M I r \\u{s)\\lds, 

where C*{h) = {u e L'^{[0, l]:l2)\h = 0"'"(1)} and where 0°'" is defined via (3.6), but 
with /,; there replaced by (pi- Note that there is a one-to-one correspondence between 
u e L2([o, 1] : I2) and ^ en given as t9(i, x) = J^Zi Mt)Mx) and \\uis)\\l ds = 
In particular u G C*{h) if and only if i9 G '^h. Thus I*{h) = I{h) and the result follows. □ 

Proposition 5.3. For each dGR", Ld defined in [5.6) is a rate function on G^^^ . 

Proof. From (5.6) and the definition of / we have for h E G^^^ that 



Ld{h)^I{h) + ]- I \T{h{x))~Yd{x)\^dx 
^ Jo 



inf <I{h) + l- I \T{h{x))-Ydix)\-'dx 



/leC'-i 1 2 



o 
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From Proposition 5.2, J is a rate function and therefore has compact level sets. Addi- 
tionally T is a continuous and bounded function on O. The result follows. □ 

Proof of Theorem 5.1. We begin by noting that r^(-|(i) defined as 



r^{A\d) = 



-i/(2e)Er=iM-j;c.r(/i(y))dyr 



-l/(2£)Er=i|rf.-/x.r(h(y))dy|' 



At=(d/i)' 



where n'^ = ¥o (X^)^^ g V{G'^~^) is an r.c.p.d.of X'^ given D = d. Using the equivalence 
between the Laplace principle and large deviations principle (see [11], Section 1.2), it 
suffices to show that for all continuous and bounded real functions F on 



elog 







/ cxp 


^-F{v) 




£ 



T%dv\d) 



(5.7) 



converges to \nij^i^Qk-i{F{h) + Id{h)}. Note that (5.7) can be expressed as 
-elog i 

-i/e[i/2Er=iid.-i;,.T(h(s;))dyp], 



elog 



^~l/e[J=^(/i) + l/2Er=ild.-4T(/«(y))dy|^]^e^^^^ 



(5.8) 



Qk-l 



V'(d/i). 



From Proposition 5.2 we see that the first term converges to 



^j^l\m+Fih) + \±d.- j 
\ 1=1 



inf inf <^ F{h) 



1 



inf {F{h) + Uh)}, 



T{h{y))dy 

T{h{y))-Ya{y)\^dy 



o 



where the last equality is a consequence of (5.3) and (5.4). Taking F = in the above 
display, we see that the second term in (5.8) converges to — A^. This proves the result. □ 



Notational conventions 

• Transpose of a d-dimensional vector v will be denoted by v' . 

• K GGW^ means that is a compact subset of Mf^ . 

• Given k- and m-dimensional continuous local martingales M, N on some filtered 
probability space (il, J^, P, {J^t}), we write the cross-quadratic variation of M and 
N as {{M,N))t, and write {{N))t when M = N. This is a continuous M"""'-valued 
{J^t}-adapted process. 
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• Borel CT-fields on a Polish space £ will be written as B{£). 

• Generic constants will be denoted as ci,C2,.... Their values may change from one 
proof to the next. 

• The infimum over an empty set is taken to be oo. 
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